Method and apparatus for generating a sequence of clock signals

ABSTRACT

A clock generator circuit generates a sequence of clock signals equally phased from each other from a master clock signal. The clock generator is formed by inner and outer delay-locked loops. The inner delay-locked loop includes a voltage controlled delay line that delays a reference clock applied to its input by a plurality of respective delays. Two of the clock signals in the sequence are applied to a phase detector so that the signals at the outputs of the delay line have predetermined phases relative to each other. The outer delay-locked loop is formed by a voltage controlled delay circuit that delays the command clock by a voltage controlled delay to provide the reference clock to the delay line of the inner delay-locked loop. The outer delay-locked loop also includes a phase detector that compares the command clock to one of the clock signals in the sequence generated by the delay line. The outer delay-locked loop thus locks one of the clock signals in the sequence to the command clock. As a result, all of the clock signals in the sequence generated by the delay line have respective predetermined phases relative to the phase of the command clock. One of the clock signals in the sequence is selected by a multiplexer to clock a command data latch at a time corresponding to the delay in coupling a command data bit to the latch.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. application Ser. No. 11/897,837, filed Aug. 31, 2007, which application is a continuation of U.S. application Ser. No. 11/182,965, filed Jul. 15, 2005 and issued on Aug. 19, 2008 as U.S. Pat. No. 7,415,404, which application is a continuation of U.S. application Ser. No. 09/758,970, filed Jan. 9, 2001 and issued on Oct. 11, 2005 as U.S. Pat. No. 6,954,097, which application is a continuation of U.S. application Ser. No. 08/879,847, filed Jun. 20, 1997 and issued on Jan. 9, 2001 as U.S. Pat. No. 6,173,432, the disclosures of which are incorporated herein by reference. This application is related to U.S. Ser. No. 12/178,442, which was filed on Jul. 23, 2008.

TECHNICAL FIELD

This invention relates to generating a sequence of accurately phased clock signals, and more particularly, to using delay and phase locked loops to provide a sequence of clock signals that are accurately phased relative to a master clock signal.

BACKGROUND OF THE INVENTION

Clock signals are used by a wide variety of digital circuits to control the timing of various events occurring during the operation of the digital circuits. For example, clock signals are used to designate when commands and other signals used in computer systems are valid and can thus be used to control the operation of the computer system. A clock signal can then be used to latch the command or other signals so that they can be used after the command or other signals are no longer valid.

The problem of accurately controlling the timing of clock signals for high speed digital circuits is exemplified by clock signals used in high speed dynamic random access memories (“DRAMs”) although the problem is, of course, also applicable to other digital circuits. Initially, DRAMs were asynchronous and thus did not operate at the speed of an external clock. However, since asynchronous DRAMs often operated significantly slower than the clock frequency of processors that interfaced with the DRAM, “wait states” were often required to halt the processor until the DRAM had completed a memory transfer. The operating speed of asynchronous DRAMs was successfully increased through such innovations as burst and page mode DRAMs which did not require that an address be provided to the DRAM for each memory access. More recently, synchronous dynamic random access memories (“SDRAMs”) have been developed to allow the pipelined transfer of data at the clock speed of the motherboard. However, even SDRAMs are incapable of operating at the clock speed of currently available processors. Thus, SDRAMs cannot be connected directly to the processor bus, but instead must interface with the processor bus through a memory controller, bus bridge, or similar device. The disparity between the operating speed of the processor and the operating speed of SDRAMs continues to limit the speed at which processors may complete operations requiring access to system memory.

A solution to this operating speed disparity has been proposed in the form of a computer architecture known as “SyncLink.” In the SyncLink architecture, the system memory is coupled to the processor directly through the processor bus. Rather than requiring that separate address and control signals be provided to the system memory, SyncLink memory devices receive command packets that include both control and address information. The SyncLink memory device then outputs or receives data on a data bus that is coupled directly to the data bus portion of the processor bus.

An example of a packetized memory device using the SyncLink architecture is shown in FIG. 1. The SyncLink memory device 10 includes a clock divider and delay circuit 40 that receives a master or command clock signal on line 42 and generates a large number of other clock and timing signals to control the timing of various operations in the memory device 10. The memory device 10 also includes a command buffer 46 and an address capture circuit 48 which receive an internal clock signal ICLK, a command packet CA0-CA9 on a command bus 50, and a FLAG signal on line 52. As explained above, the command packet contains control and address data for each memory transfer, and the FLAG signal identifies the start of a command packet which may include more than one 10-bit packet word. In fact, a command packet is generally in the form of a sequence of 10-bit packet words on the 10-bit command bus 50. The command buffer 46 receives the command packet from the bus 50, and compares at least a portion of the command packet to identifying data from an ID register 56 to determine if the command packet is directed to the memory device 10 or some other memory device 10 in a computer system. If the command buffer determines that the command is directed to the memory device 10, it then provides a command word to a command decoder and sequencer 60. The command decoder and sequencer 60 generates a large number of internal control signals to control the operation of the memory device 10 during a memory transfer.

The address capture circuit 48 also receives the command words from the command bus 50 and outputs a 20-bit address corresponding to the address data in the command. The address is provided to an address sequencer 64 which generates a corresponding 3-bit bank address on bus 66, a 10-bit row address on bus 68, and a 7-bit column address on bus 70.

One of the problems of conventional DRAMs is their relatively low speed resulting from the time required to precharge and equilibrate circuitry in the DRAM array. The packetized memory device 10 shown in FIG. 1 largely avoids this problem by using a plurality of memory banks 80, in this case eight memory banks 80 a-h. After a memory read from one bank 80 a, the bank 80 a can be precharged while the remaining banks 80 b-h are being accessed. Each of the memory banks 80 a-h receive a row address from a respective row latch/decoder/driver 82 a-h. All of the row latch/decoder/drivers 82 a-h receive the same row address from a predecoder 84 which, in turn, receives a row address from either a row address register 86 or a refresh counter 88 as determined by a multiplexer 90. However, only one of the row latch/decoder/drivers 82 a-h is active at any one time as determined by bank control logic 94 as a function of bank data from a bank address register 96.

The column address on bus 70 is applied to a column latch/decoder 100 which, in turn, supplies I/O gating signals to an I/O gating circuit 102. The I/O gating circuit 102 interfaces with columns of the memory banks 80 a-h through sense amplifiers 104. Data is coupled to or from the memory banks 80 a-h through the sense amplifiers 104 and I/O gating circuit 102 to a data path subsystem 108 which includes a read data path 110 and a write data path 112. The read data path 110 includes a read latch 120 receiving and storing data from the I/O gating circuit 102. In the memory device 10 shown in FIG. 1, 64 bits of data are applied to and stored in the read latch 120. The read latch then provides four 16-bit data words to a multiplexer 122. The multiplexer 122 sequentially applies each of the 16-bit data words to a read FIFO buffer 124. Successive 16-bit data words are clocked through the FIFO buffer 124 by a clock signal generated from an internal clock by a programmable delay circuit 126. The FIFO buffer 124 sequentially applies the 16-bit words and two clock signals (a clock signal and a quadrature clock signal) to a driver circuit 128 which, in turn, applies the 16-bit data words to a data bus 130. The driver circuit 128 also applies the clock signals to a clock bus 132 so that a device, such as a processor, reading the data on the data bus 130 can be synchronized with the data.

The write data path 112 includes a receiver buffer 140 coupled to the data bus 130. The receiver buffer 140 sequentially applies 16-bit words from the data bus 130 to four input registers 142, each of which is selectively enabled by a signal from a clock generator circuit 144. Thus, the input registers 142 sequentially store four 16-bit data words and combine them into one 64-bit data word applied to a write FIFO buffer 148. The write FIFO buffer 148 is clocked by a signal from the clock generator 144 and an internal write clock WCLK to sequentially apply 64-bit write data to a write latch and driver 150. The write latch and driver 150 applies the 64-bit write data to one of the memory banks 80 a-h through the I/O gating circuit 102 and the sense amplifier 104.

As mentioned above, an important goal of the SyncLink architecture is to allow data transfer between a processor and a memory device to occur at a significantly faster rate. However, the operating rate of a packetized DRAM, including the packetized memory device 10 shown in FIG. 1, is limited by the time required to receive and process command packets applied to the memory device 10. More specifically, not only must the command packets be received and stored, but they must also be decoded and used to generate a wide variety of signals. However, in order for the memory device 10 to operate at a very high speed, the command packets must be applied to the memory device 10 at a correspondingly high speed. As the operating speed of the memory device 10 increases, the command packets are provided to the memory device 10 at a rate that can exceed the rate at which the command buffer 46 can process or even store the command packets.

One of the limiting factors in the speed at which the command buffer 46 can store and process the command packets is control of the relative timing between the command packets and the clock signal ICLK. Both the command data signals and the ICLK signal are delayed relative to receipt of the command packet on the command bus 50 and the master clock signal on line 42. Furthermore, the amount of the delay is highly variable, and it is difficult to control. If the delay of the internal clock signal ICLK cannot be precisely controlled, it may cause the latch in the command buffer 48 to latch invalid command data signals. Thus, the speed at which command packets can be applied to the memory device 10 is limited by the delays in the memory device 10. Similar problems exist for other control signals in the memory device 10 which control the operation of the memory device 10 during each clock cycle.

Although the foregoing discussion is directed to the need for faster command buffers in packetized DRAMs, similar problems exist in other memory devices, such as asynchronous DRAMs and synchronous DRAMs, which must process control and other signals at a high rate of speed. Thus, for the reasons explained above, the limited operating speed of conventional command buffers threatens to severely limit the maximum operating speed of memory devices, particularly packetized DRAMs. Therefore, there is a need to precisely control the timing of clock signals relative to other signals, such as command packets applied to a command buffer in a packetized DRAM.

SUMMARY OF THE INVENTION

A clock generator circuit is used to provide a sequence of clock signals that have predetermined phases relative to a master clock signal. The clock generator circuit includes a first locked loop generating the sequence of clock signals which are increasingly delayed from a first clock signal to a last clock signal. Two of the clock signals, preferably the first and last clock signals, are locked to each other, such as by using a delay-locked loop, so that they have a predetermined phase with respect to each other. A second lock loop, which may also be a delay-locked loop, locks one of the clock signals in the sequence to the master clock signal so that the clock signals have respective phases with respect to the master clock signal. Where delay-locked loops are used, the first delay lock loop delay may lock the first clock signal and the last clock signal so that they are the inverse of each other. As a result, the first and last clock signals have respective phases that are 180 degrees from each other. Similarly, the second delay-lock loop may delay lock the first clock signals to the master clock signal so that they have substantially the same phase. The first delay-locked loop preferably includes a first voltage controlled delay circuit and a first phase detector. The first voltage controlled delay circuit generates the sequence of clock signals from the reference clock signal by delaying the reference clock signal by respective delays that are a function of a first control signal. A first phase detector compares the phase of the first and last clock signals and generates the first control signal as a function of the difference therebetween. Likewise, the second delay-locked loop preferably includes a second voltage controlled delay circuit and a second phase detector. The second voltage controlled delay circuit receives the master clock signal and generates a reference clock signal having a delay relative to the master clock signal that is a function of a second control signal. The second phase detector compares the phase of the master clock signal to the phase of the first clock signal and generating the second control signal as a function of the difference therebetween. A multiplexer may be coupled to the first delay-lock loop to couple one of the clock signals to a clock output terminal for use, for example, to latch command data in packetized DRAM.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram of a conventional packetized dynamic random access memory (“DRAM”) that may use a clock generator in accordance with an embodiment of the present invention.

FIG. 2 is a block diagram illustrating the manner in which an embodiment of clock generator in accordance with present invention may be used in a command latch in the packetized DRAM of FIG. 1.

FIG. 3 is a more detailed block diagram and logic diagram of the command latch of FIG. 2 using an embodiment of a clock generator in accordance with present invention.

FIG. 4 is a timing diagram showing many of the waveforms present in the command latch of FIG. 3.

FIG. 5 is a block diagram of a computer system using a plurality of DRAMs, each of which include the command latch of FIG. 3.

DETAILED DESCRIPTION OF THE INVENTION

One embodiment of a command latch 200 using an embodiment of a clock generator 210 in accordance with present invention is illustrated in FIG. 2. The command latch 200 also includes a latch circuit 212 receiving a command data bit CMD DATA from a command packet on a line of the command bus 50. The clock generator 210 receives a master or command clock signal CMD CLK on line 42 and generates the internal clock signal ICLK which is applied to the clock input of the latch circuit 212. The output of the latch circuit 212 is coupled to a variety of circuits in the memory device 10, including a select circuit 214. As explained below, the clock generator 210 produces a sequence of clock signals each having an increased delay relative to the leading edge of the master clock. One of the clock signals in the sequence is selected for use as the internal clock signal ICLK for clocking the latch circuit 212. The clock signal in the sequence for use as ICLK is selected by a multi-bit select word SELECT generated by the select circuit 214. Basically, the select circuit 214 determines which of the clock signals in the sequence has the proper timing to match the delay of the command data from the command bus 50 to the input of the latch circuit 212. The select circuit 214 then applies an appropriate SELECT word to the clock generator 210 to use the selected clock signal in the sequence as the internal clock signal ICLK.

A variety of designs may be used for the select circuit 214, as will be apparent to one skilled in the art. For example, a plurality of very short logic “1” pulses may be applied to the command bus line 50 in synchronism with the command clock signal CMD CLK. The select circuit 214 can then select each of the clock signals in the sequence and determine which clock signal(s) are able to capture the logic “1” pulses. If several clock signals are successful in capturing the logic “1” pulses, then the clock signal occurring midway between the successful clock signals can be used as the internal clock signal ICLK.

The command latch 200, including one embodiment of a clock generator 210 in accordance with the present invention, is shown in greater detail in FIG. 3. The latch 212 is illustrated in FIG. 3 as consisting of a receiver buffer 240 receiving a respective one of 10 bits of the command data CMD DATA. The output of the receiver buffer 240 is applied to the data input D of a latch circuit 242. The latch circuit 242 latches or stores the logic level applied to its D input whenever a clock signal applied to its clock C input goes high. The stored logic level is then continuously applied to the output of the latch circuit 242. Although a single receiver buffer 240 and latch circuit 242 are shown in FIG. 3, it will be understood that there are actually 10 receiver buffers 240 and latch circuits 242 to store the 10 bits of the command data on the command bus 50.

As explained above, it is very difficult to clock the latch circuit 242 at the proper time at the maximum speed of the command latch 200 because delays of the CMD DATA from the command bus 50 to the latch circuit 242 may not be equal to delays of the CMD CLK from the line 42 to the clock input of the latch circuit 242. Also, of course, unequal delays external to an integrated circuit containing the command latch 200 may cause the CMD CLK and the CMD DATA to be applied to the integrated circuit at different times. The function of the clock generator 210 is to provide an internal clock signal ICLK to the clock input of the latch circuit 242 that is capable of storing a command data bit at even the fastest operating speed of the command latch 200 despite any unequal internal or external delays that would cause the command bit and ICLK to be coupled to the latch circuit 242 at different times. However, it will be understood that the clock generator 210 may be used for other purposes both in dynamic random access memories and in other circuits.

The master or command clock CMD CLK is coupled from line 42 through a receiver buffer 250 substantially identical to the receiver buffer 240 coupling a command data bit to the latch circuit 242. The output of the receiver buffer 250 is applied to a conventional voltage controlled delay circuit 252 and to one input of a phase detector 254. The voltage controlled delay circuit 252 couples the output of the receiver buffer 250 to an output line 256 with a delay that is a function of a control signal applied to the delay circuit 252 on line 258. Although the control signal on line 258 is an analog voltage, it will be understood that other types of control signals, including digital words, may alternatively be used. The output of the voltage controlled delay circuit 252 is applied to a multi-tap voltage controlled delay line 260.

The multi-tap voltage controlled delay line 260 couples the clock signal applied to its input on line 256 to a plurality of output lines 264 a-264 n. The incoming clock signal is coupled to the output lines 264 with an increasing delay from the first line 264 a to the last line 264 n. In the embodiment illustrated in FIG. 3, there are 17 output lines 264, but the delay line 260 may have a greater or lesser number of output lines 264. When a delay locked loop that includes the delay line 260 is locked as explained below, the signals at the first output line 264 a and the last or 17th output line 264 n are the inverse of each other, i.e., phased 180 degrees from each other. Thus, the signals on the 17 lines are delayed by 11.25 degrees more than the signal coupled to the previous line 264. Thus, the first line 264 a has a relative phase of zero degrees, the 16th line 264 n−1 has a phase of 168.75 degrees and the last line 264 n has a phase of 180 degrees. More specifically, a control voltage applied to the delay line 260 through line 270 is adjusted so that the phase of the signal on the last line 264 n relative to the phase on the first line 264 a is 180 degrees. This is accomplished by applying the first line 264 a and the last line 264 n to respective inputs of a phase detector 272.

As mentioned above, the delay line 260 and phase detector 272 implement a first delay locked loop. When the first delay locked loop is locked, the signal on line 264 n will have a phase relative to the phase of the signal on line 264 a of 180 degrees. Therefore, as mentioned above, the signal on each of the output lines 264 a-264 n will sequentially increase from zero degrees to 180 degrees. Although the signals on lines 264 a-n are equally phased apart from each other, it will be understood that equal phasing is not required.

The clock generator 210 also includes a second delay locked loop formed by the phase detector 254, the voltage controlled delay circuit 252 and the voltage controlled delay line 260. More particularly, the last output line 264 n of the delay line 260 is applied through a simulated multiplexer circuit 290 and a clock driver 292 to one input of the phase detector 254. It will be recalled that the other input of the phase detector 254 receives the output of the receiver buffer 250. Like the phase detector 272, when the second delay locked loop is locked, the signals applied to the phase detector 254 are the inverse of each other. Thus, when the second loop is locked, the phase of the signal at the output of the clock driver 292 is 540 degrees (effectively 180 degrees) relative to the phase of the signal at the output of the receiver buffer 250.

The remaining output lines 264 a-264 n−1 of the delay line 260 are coupled to a multiplexer 310 having a plurality of output lines coupled to respective clock drivers 312 a-n. The multiplexer 310 couples the input of each of the clock drivers 312 a-n to any one of the output lines 264 a-264 n−1 as determined by respective select words SELECT. The clock driver 312 a is used to generate the internal clock signal ICLK which is coupled to the clock input of the latch circuit 242. The other clock drivers 312 b-n are used to couple various clock outputs from the delay line to other circuits in the memory device (not shown).

The phase detectors 254, 272 are each implemented using to a phase detector circuit 330, a charge pump 332 and a capacitor 334. However, other varieties of phase detectors may alternatively be used.

The phase detector circuit 330 applies either an increase signal on line 336 or a decrease signal on line 338 to respective inputs of the charge pump 332. The phase detector circuit 330 generates the increase signal on line 336 whenever the phase of a first signal on one of its inputs relative to a second signal on the other of its inputs is less than 180 degrees. As explained below, the increase signal on line 336 causes the charge pump 332 to adjust the control voltage to increase the delay of the first signal so that the phase of the first signal relative to the phase of the second signal approaches 180 degrees. The phase detector circuit 330 generates the decrease signal on line 338, in the opposite condition, i.e., when the phase of the second signal relative to the first signal is greater than 180 degrees. The decrease signal on line 338 causes the charge pump 332 to adjust the control voltage to reduce the delay of second signal toward 180 degrees.

Although the phase detector circuit 330 may be implemented in a variety of ways, it may simply use two set-reset flip-flops (not shown) for generating the increase and decrease signals, respectively. The increase flip-flop is set by the rising edge of the first signal on one of the inputs and reset by the falling edge of the second signal on the other input. Thus, the duration that the flip-flop is set, and hence the duration of the increase signal on line 336, corresponds to the period of time that the second signal must be further delayed to have a phase of 180 degrees relative to the phase of the first signal. Similarly, the flip-flop producing the decrease signal on line 338 is set by the falling edge of the second signal and reset by the rising edge of the first signal so that the duration of the decrease signal on line 338 corresponds to the time that the second signal is delayed beyond the time that it would have a phase of 180 degrees relative to the phase of the first signal.

There are also a variety of approaches for implementing the charge pump 332. However, the charge pump 332 can be implemented by simply applying a constant current to the capacitor 334 for the duration of each increase signal on line 336 and removing a constant current from the capacitor 334 for the duration of each decrease signal on line 338. Appropriate circuitry could also be included in either the phase detector circuit 330 or the charge pump 332 to provide hysteresis in a band when the first and second signals have relative phases of approximately 180 degrees from each other as will be apparent to one skilled in the art. The operation of the command latch 200 of FIG. 3 can best be explained with reference to the timing diagram of FIG. 4. As illustrated in FIG. 4, the command clock signal CMD CLK on line 42 is delayed by approximately 70 degrees in passing through the receiver buffer 250 to node A (FIG. 3). Assuming that both of the delay-lock loops are locked, the signal at the output of the receiver buffer 250 is delayed by 120 degrees in passing through the voltage controlled delay circuit 252 to node B. The signal on node B is then coupled to node C with a delay of another 120 degrees and to node D with a delay of 300 degrees so that the signals at nodes C and D are phased 180 degrees apart from each other. Since the signals at nodes C and D are compared to each other by the phase detector 272, the phase detector 272 adjusts the control voltage on line 270 to ensure that the signals at nodes C and D are phased 180 degrees from each other. The other outputs from the delay line 260 have phases relative to the phase of the signal at node C that increase 11.25 degrees for each output in sequence from the first line 264 a to the last line 264 n.

As mentioned above, one of the first 16 output lines 264 a-264 n−1 of the delay lines 260 is coupled through the multiplexer 310 and the clock driver 312 a to provide the internal clock signal ICLK at node E. In passing through the multiplexer 310 and the clock driver 312 a, the selected output from the delay line is delayed by another 120 degrees. Thus, the signal Eo coupled from the first output line of the delay line 260 is delayed by 120 degrees, the signal E4 from the fifth output is delayed by 165 degrees, the signal E8 from the ninth output is delayed by 210 degrees, the signal E12 from the 13th output is delayed by 255 degrees, and the signal E15 from the 16th output is delayed by 288.75 degrees. Although the output signals are coupled from the delay line 260 through the multiplexer 310 and clock driver 312 a with a delay, that delay is matched by the coupling of the signal from line 264 n through the simulated multiplexer 290 and clock driver 292 since the same circuit is used for the simulated multiplexer 290 as the multiplexer 310 and the clock driver 292 is identical to the clock driver 312 a. For this reason, and because the phase of the signal on line 264 n is 180 degrees relative to the phase of the signal on line 264 a, the signal at the output of the clock driver 292 at node G has a phase relative to the phase of the signal Eo at the output of the clock 31′a of 180 degrees. Since the signals applied to the inputs of the phase detector 254 are the inverse of each other when the delay-locked loop is locked, the signal Eo has substantially the same phase as the signal at the output of the receiver buffer 250. Furthermore, the delay of the voltage controlled delay circuit 252 will be adjusted so that the signal Eo always has the same phase as the command clock coupled to the output of the receiver buffer 250 at A. Assuming the CMD DATA is valid on the rising edge of the CMD CLK signal, the command data bit coupled to the latch circuit 242 is valid on the rising edge of ICLK since ICLK is properly phased to the signal at node A and the delay through the receiver buffer 250 is substantially the same as the delay through the receiver buffer 240.

In operation, the multiplexer 310 selects one of the outputs from the delay line 260 as determined by the SELECT signal so that the optimum clock signal between Eo and E15 (FIG. 4) will be used as the internal clock ICLK.

In summary, the “inner” delay locked loop formed by the phase detector 272 and the voltage controlled delay circuit 260 generates a sequence of signals that have increasing phases from zero to 180 degrees. The “outer” delay locked loop formed by the phase detector 254, the voltage controlled delay circuit 252 and the delay line 260 align one of the clock signals in the sequence to the command clock. As a result, all of the clock signals at the output of the delay line 260 have respective predetermined phases relative to the phase of the command clock at node A.

Although the embodiment of the clock generator 210 illustrated in FIG. 3 uses delay-locked loops, it will be understood that other locked loop circuits, such as phase-locked loop circuits, may also be used. Other modifications will also be apparent to one skilled in the art.

A computer system using the command latch 200 of FIGS. 2 and 3 in each of a plurality of packetized DRAMs 10 of FIG. 1 is shown in FIG. 5. With reference to FIG. 5 the computer system 400 includes a processor 402 having a processor bus 404 coupled to three packetized dynamic random access memory or SyncLink DRAMs (“SLDRAM”) 10 a-c. The computer system 400 also includes one or more input devices 4100, such as a keypad or a mouse, coupled to the processor 402 through a bus bridge 412 and an expansion bus 414, such as an industry standard architecture (“ISA”) bus or a Peripheral component interconnect (“PCI”) bus. The input devices 410 allow an operator or an electronic device to input data to the computer system 400. One or more output devices 420 are coupled to the processor 402 to display or otherwise output data generated by the processor 402. The output devices 420 are coupled to the processor 402 through the expansion bus 414, bus bridge 412 and processor bus 404. Examples of output devices 420 include printers and video display units. One or more data storage devices 422 are coupled to the processor 402 through the processor bus 404, bus bridge 412, and expansion bus 414 to store data in or retrieve data from storage media (not shown). Examples of storage devices 422 and storage media include fixed disk drives floppy disk drives, tape cassettes and compact-disk read-only memory drives.

In operation, the processor 402 communicates with the memory devices 10 a-c via the processor bus 404 by sending the memory devices 10 a-c command packets that contain both control and address information. Data is coupled between the processor 402 and the memory devices 10 a-c, through a data bus portion of the processor bus 404. Although all the memory devices 10 a-c are coupled to the same conductors of the processor bus 404, only one memory device 10 a-c at a time reads or writes data, thus avoiding bus contention on the processor bus 404. Bus contention is avoided by each of the memory devices 10 a-c and the bus bridge 412 having a unique identifier, and the command packet contains an identifying code that selects only one of these components.

The computer system 400 also includes a number of other components and signal lines which have been omitted from FIG. 5 in the interests of brevity. For example, as explained above, the memory devices 10 a-c also receive a command or master clock signal to provide internal timing signals, a data clock signal clocking data into and out of the memory device 16, and a FLAG signal signifying the start of a command packet.

From the foregoing it will be appreciated that, although specific embodiments of the invention have been described herein for purposes of illustration, various modifications may be made without deviating from the spirit and scope of the invention. Accordingly, the invention is not limited except as by the appended claims. 

The invention claimed is:
 1. A computer system, comprising: a processor having a processor bus; an input device coupled to the processor through the processor bus adapted to allow data to be entered into the computer system; an output device coupled to the processor through the processor bus adapted to allow data to be output from the computer system; and a memory coupled to the processor bus adapted to allow data to be stored, the dynamic random access memory comprising: at least one array of memory cells adapted to store data at a location determined by a row address and a column address responsive to a command word; a row address circuit adapted to receive and decode the row address, and select a row of memory cells corresponding to the row address responsive to the command word; a column address circuit adapted to receive or apply data to one of the memory cells in the selected row corresponding to the column address responsive to the command word; a data path circuit adapted to couple data between an external terminal and the column address circuit responsive to the command word; and a command data latch circuit for storing a command data packet at a time determined from a command clock signal, the command data latch comprising: a latch circuit configured to receive the command data and receive a selected internal clock signal appropriate to latch the command data, the latch circuit operable to latch the command data in accordance with the selected internal clock signal; a clock generator configured to receive a master clock signal based on the command clock signal, the clock generator operable to generate a plurality of internal clock signals, including the selected internal clock signal and a reference clock signal, each internal clock signal having a different phase relative to the command clock signal and the reference clock signal having a phase relative to the command clock signal, the clock generator further configured to receive a select signal, the clock generator operable to couple the selected internal clock signal to the latch circuit responsive to the select signal; a select circuit coupled to the latch circuit and the clock generator, the select circuit operable to identify the selected internal clock signal from the plurality of internal clock signals and generate the select signal for the clock generator; a first locked loop operable to lock the phase between a first one of the plurality of internal clock signals and the reference clock signal; and a second locked loop operable to lock the phase between the master clock signal and the reference clock signal.
 2. The computer system of claim 1 wherein at least one of the first and second locked loops comprises a phase-locked loop.
 3. The computer system of claim 1 wherein at least one of the first and second locked loops comprises a delay-locked loop.
 4. The computer system of claim 1 wherein the second locked loop comprises: a first signal generator configured to receive the master clock signal and a first control signal and operable to generate an intermediate clock signal having a predetermined phase relative to the master clock signal based on the first control signal; a second signal generator configured to receive the reference clock signal and a second control signal and operable to generate the plurality of internal clock signals and the reference clock signal; and wherein the second locked loop is operable to generate the first control signal based on a comparison of the master clock signal and the reference clock signals and the first locked loop is operable to generate the second control signal based on a comparison of the first of the plurality of internal clock signals and the reference clock signal.
 5. The computer system of claim 4 wherein the second locked loop further comprises: a first phase detector operable to compare the phase between the master clock signal and the reference clock signals; and a first charge pump coupled to the first phase detector, the first charge pump operable to provide the first control signal to the first signal generator.
 6. The computer system of claim 5 wherein the first locked loop comprises the second signal generator, a second phase detector operable to compare the phase between the first of the internal clock signals and the reference clock signal; and a second charge pump coupled to the second phase detector, the second charge pump operable to provide the second control signal to the second signal generator.
 7. The computer system of claim 4 wherein the charge pump further comprises: a multiplexer coupled to the second signal generator, the multiplexer operable to receive the selected internal clock signal responsive to the select signal; and a clock driver coupled to the multiplexer and the latch, the clock driver configured to receive the selected internal clock signal from the multiplexer and operable to couple the selected internal clock signal to the latch, wherein the multiplexer and the clock driver delay the selected internal clock signal by a predetermined internal delay amount.
 8. The computer system of claim 4 wherein the command latch further comprises: a simulated multiplexer coupled to the second signal generator and configured to receive the reference clock signal; and a second clock driver coupled to the simulated multiplexer and the second locked loop, the second clock driver configured to receive one of the plurality of internal clock signals from the simulated multiplexer and operable to couple the one of the plurality of internal clock signals to the second locked loop, wherein the simulated multiplexer and the second clock driver delay the one of the plurality of internal clock signals the predetermined internal delay amount.
 9. The computer system of claim 1 wherein the second locked loop comprises: a first signal generator operable to produce the reference clock signal at an output terminal, the reference clock signal having a phase relative to the master clock signal that is a function of a first control signal applied to a first control terminal; and a first phase detector having a first input terminal coupled to the master clock signal, a second input terminal coupled to the reference clock signal, and an output terminal coupled to the first control terminal of the first signal generator, the first phase detector operable to generate the control signal as a function of the difference in phase between the master clock signal and the reference clock signal at the first and second input terminals, respectively; and wherein the first locked loop comprises: a second signal generator coupled to the reference clock signal from the first signal generator and operable to produce the plurality of internal clock signals, the phase of the internal clock signals relative to the reference clock signal being a function of a second control signal applied to a second control terminal; and a second phase detector having a first input terminal coupled to the reference clock signal from the first signal generator, a second input terminal coupled to one of the plurality of clock signals, and an output terminal coupled to the second control terminal of the second signal generator, the second phase detector operable to generate the control signal as a function of the difference in phase between the reference clock signal and the clock signal applied to the first and second input terminals, respectively.
 10. A computer system, comprising: a processor having a processor bus; an input device coupled to the processor through the processor bus adapted to allow data to be entered into the computer system; an output device coupled to the processor through the processor bus adapted to allow data to be output from the computer system; and a memory coupled to the processor bus adapted to allow data to be stored, the dynamic random access memory comprising: at least one array of memory cells adapted to store data at a location determined by a row address and a column address responsive to a command word; a row address circuit adapted to receive and decode the row address, and select a row of memory cells corresponding to the row address responsive to the command word; a column address circuit adapted to receive or apply data to one of the memory cells in the selected row corresponding to the column address responsive to the command word; a data path circuit adapted to couple data between an external terminal and the column address circuit responsive to the command word; and a command data latch circuit for storing a command data packet at a time determined from a command clock signal, the command data latch comprising: a latch circuit having a data input and a clock input, the data input adapted to receive the command data packet and the latch circuit operable to store the command data packet responsive to a clock signal applied to the clock input; a clock generator circuit operable to generate a sequence of clock signals from a periodic master clock signal, the sequence of clock signals increasingly delayed from a first clock signal to a last clock signal, the sequence of clock signals including the clock signal applied to the clock input of the latch circuit, a select circuit operable to determine which of the signals in the sequence of clock signals from the clock generator should be used to cause the latch circuit to store the command data packet and generate the select signal corresponding thereto; a first locked loop operable to generate the sequence of clock signals and lock two of the clock signals in the sequence such that they have a predetermined phase with respect to each other; and a second locked loop operable to lock one of the clock signals in the sequence of clock signals to the master clock signal so that the clock signals in the sequence have respective phases with respect to the master clock signal.
 11. The computer system of claim 10, wherein at least one of the first and second locked loops comprises a phase locked loop.
 12. The computer system of claim 10, wherein at least one of the first and second locked loops comprises a delay locked loop.
 13. The computer system of claim 10, further comprising: a first multiplexer coupled to the first locked loop, the first multiplexer operable to couple the selected clock signal to the clock input of the latch circuit, the multiplexer delaying the selected clock signal by a predetermined delay amount; and a second multiplexer coupled to the second locked loop, the second multiplexer operable to couple the one of the clock signals to the second locked loop such that the one of the clock signals is delayed by the predetermined delay amount.
 14. A computer system, comprising: a processor having a processor bus; an input device coupled to the processor through the processor bus adapted to allow data to be entered into the computer system; an output device coupled to the processor through the processor bus adapted to allow data to be output from the computer system; and a memory coupled to the processor bus adapted to allow data to be stored, the dynamic random access memory comprising: at least one array of memory cells adapted to store data at a location determined by a row address and a column address responsive to a command word; a row address circuit adapted to receive and decode the row address, and select a row of memory cells corresponding to the row address responsive to the command word; a column address circuit adapted to receive or apply data to one of the memory cells in the selected row corresponding to the column address responsive to the command word; a data path circuit adapted to couple data between an external terminal and the column address circuit responsive to the command word; and a command data latch circuit for storing a command data packet at a time determined from a command clock signal, the command data latch comprising: a clock generator configured to receive the command clock, the clock generator operable to generate a plurality of internal clock signals such that each of the plurality of internal clock signals has a predetermined phase relative to the phase of the command clock; a latch circuit configured to receive the command data and a selected clock signal of the sequence of internal clock signals, the latch circuit operable to latch the command data using the selected clock signal; a select circuit coupled to the clock generator and the latch circuit, the select circuit operable to determine the selected clock signal of the sequence of clock signals that will properly latch the command data; an inner locked loop operable to generate a sequence of signals having increasing predetermined phases relative to one another; and an outer locked loop operable to align one of the sequence of signals to the command clock signal, thereby generating the sequence of internal clock signals such that each of the internal clock signals has a different predetermined phase relative to the command clock signal.
 15. The computer system of claim 14, wherein at least one of the inner and outer locked loops comprises a phase locked loop.
 16. The computer system of claim 14, wherein at least one of the inner and outer locked loops comprises a delay locked loop.
 17. The computer system of claim 14, further comprising: a first multiplexer coupled to the signal generator and the latch circuit, the first multiplexer operable to receive the selected clock signal and couple the selected clock signal to the latch circuit, thereby delaying the selected clock signal by a predetermined delay amount; and a second multiplexer coupled to the inner locked loop and the outer locked loop, the second multiplexer operable to couple at least one of the sequence of signals from the inner locked loop to the outer locked loop, the simulated multiplexer delaying the at least one signal by the predetermined delay amount.
 18. The computer system of claim 14, wherein the one of the sequence of signals is coupled from the inner locked loop to the outer locked loop.
 19. The computer system of claim 18 further comprising a simulated multiplexer operable to couple the one of the sequence of signals from the inner locked loop to the outer locked loop such that the one of the sequence is delayed a predetermined amount determined by the delay associated with the latch circuit receiving the selected clock signal.
 20. The computer system of claim 19 further comprising a multiplexer operable to couple the selected clock signal to the latch circuit with a multiplexer delay, the simulated multiplexer configured to impose the multiplexer delay on the one of the sequence of signals.
 21. The computer system of claim 20 further comprising a first clock driver operable to couple the selected clock signal from the multiplexer to the latch circuit and a second clock driver operable to couple the one of the sequence of signals from the simulated multiplexer to the outer locked loop, the first and second clock drivers configured to impose the same delay on the selected clock signal and one of the sequence of signals, respectively.
 22. The computer system of claim 14, wherein the select circuit is operable to select each of the sequence of clock signals in sequence and determine which clock signal properly latches the command data. 